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Abstract — In this paper, the stability condition for low-density 
parity-check (LDPC) codes on the binary erasure channel (BEC) 
is extended to generalized LDPC (GLDPC) codes and doubly- 
generalized LDPC (D-GLDPC) codes. It is proved that, in both 
cases, the stability condition only involves the component codes 
with minimum distance 2. The stability condition for GLDPC 
codes is always expressed as an upper bound to the decoding 
threshold. This is not possible for D-GLDPC codes, unless all 
the generalized variable nodes have minimum distance at least 
3. Furthermore, a condition called derivative matching is defined 
in the paper. This condition is sufficient for a GLDPC or D- 
GLDPC code to achieve the stability condition with equality. 
If this condition is satisfied, the threshold of D-GLDPC codes 
(whose generalized variable nodes have all minimum distance at 
least 3) and GLDPC codes can be expressed in closed form. 

I. Introduction 

A traditional LDPC code [1] of length N and dimension 
K is graphically represented through a bipartite graph with N 
variable nodes and M > N — K check nodes [2]. A degree-n 
check node of an LDPC code can be interpreted as a length-n 
single parity-check (SPC) code, i.e. as a (n, n—1) linear block 
code, while a degree-n variable node can be interpreted as a 
length-n repetition code, i.e. as a (n, 1) linear block code. 

Doubly-generalized LDPC (D-GLDPC) codes, recently in- 
troduced in [3] [4], extend the concept of LDPC codes, by 
allowing some variable and check nodes to be generic linear 
block codes instead of repetition and SPC codes respectively. 
If all the variable nodes are repetition codes, then the code is a 
generalized LDPC (GLDPC) code [2], [5]-[8]. The codes used 
as variable and check nodes are called component codes of 
the D-GLDPC code; they will be supposed to have minimum 
distance d m - m > 2 [4]. The variable and check nodes which 
are not, respectively, repetition or SPC codes, are referred 
to as generalized nodes. The corresponding code structure 
is represented in Fig. [T] If Ny is the number of variable 
nodes, then the codeword length is N = X}j=i (with fcj 
dimension of the i-th variable node). An (n, k) generalized 
variable node is connected to n check nodes; k of the N D- 
GLDPC encoded bits are received by the generalized variable 
node, and interpreted as its k information bits. An (n, k) 
generalized check node is connected to n variable nodes. If 
Nc is the number of check nodes, then the number of parity 
check equations is M — 2~^tfi( n « — hi) (with fcj and 



n i ~ ^2 parity-check 
equations 




fci received 



Fig. 1. Structure of a D-GLDPC code. 

respectively, dimension and length of the i-th check node). The 
ensembles of variable and check nodes are called variable node 
decoder (VND) and check node decoder (CND), respectively. 
For a description of the D-GLDPC codes iterative decoder on 
the AWGN channel and the BEC, we refer to [3], [4]. 

For standard LDPC ensembles, an important role is played 
by an inequality known as stability condition [9], [10]. For 
transmission on a BEC with erasure probability q, the sta- 
bility condition establishes the following upper bound to the 
asymptotic threshold q* for the LDPC ensemble: 

^[A'toip'tir 1 . (i) 

In (Q]), A'(0) = A2 is the fraction of edges towards the 
length-2 repetition variable nodes, while p'(l) is the derivative 
(computed in x = 1) of the function p{x) = J2j>2 Pi^ 1 > 
where pj is the fraction of edges connected to SPC check 
nodes of length j. The bound (fl]i was first developed from 
density evolution. However, it is possible to interpret it in 
a simple graphical way, by exploiting EXIT charts [11]. 
More specifically, the stability condition is equivalent to the 
following statement: For q = q*, the derivative of the EXIT 
function Ie,v{Ia, <?) for the VND, with respect to and 
evaluated in I a = 1, must be smaller than the derivative of 
the inverse EXIT function I^} c {Ia) for the CND, evaluated 
in I a = 1, i-e. ([1]) is equivalent to 

dI E ,v(lA,q*)/dlA |/ A =i < dI E ^ c /dI A \ lA = 1 . (2) 

'Ia denotes the average a priori mutual information in input to the VND 
or to the CND. 



The LDPC stability condition ([l} is tight in that there exist 
LDPC distributions whose threshold achieves it with equality, 
assuming the closed form q* = [A'(0)//(1)] _1 . For achieving 
((TJ with equality, it is sufficient that the first occurrence of a 
tangency point between the EXIT function Ie.v{Ia, q) of the 
VND and the inverse EXIT function I e ] c {Ia) of the CND 
appears in I a = 1, i.e. 



I E y{\,q*)=I E ] c {\) = l 
dI E ,v(lA,q*)/dlA \l A =l = 



Ia = 1- 



(3) 



For LDPC codes, the first equality is always true. As proved 
in [4], it is always satisfied also for GLDPC and D-GLDPC 
codes, if all the variable and check component codes have 
dmin > 2, which is assumed true in this paper. Then, only the 
second equality will be considered in the sequel, and referred 
to as derivative matching condition. 

In this paper, the stability condition ©, and the derivative 
matching condition (O are extended to GLDPC and D-GLDPC 
codes. Two main results are obtained. The first is that only the 
component codes with drain = 2, including length-2 repetition 
codes and SPC codes, appear in the stability condition. The 
second is that, for GLDPC codes satisfying the derivative 
matching condition, it is always possible to develop a closed- 
form expression of the threshold; the same expression holds 
also for D-GLDPC codes satisfying the derivative matching 
condition, if all the generalized variable nodes have <i m i n > 3. 

II. Definitions and Basic Notation 

The transmission channel is a BEC with erasure probability 
q. Assuming a bipartite graph with random connections, the 
extrinsic channel, over which the messages are exchanged 
between the variable and check nodes, during the iterative 
decoding process, is modelled as a second BEC with erasure 
probability p [11] (depending on the decoding iteration). It is 
readily proved that I a = 1 — p: Since the EXIT functions 
will be expressed as functions of p (and q for the VND), the 
derivatives of the VND EXIT function and of the CND inverse 
EXIT function will be evaluated at p = (I a = 1)- 

Under the hypothesis of random bipartite graph, the VND 
and CND EXIT functions can be expressed, respectively, as 



Ie,v(p, q) = ^Xi Ie] v (p, q) 



L E,C 



(P) 



1c 



r(i) 
l E,C 



(P), 



(4) 



(5) 



where Xy and Iq are the number of variable and check node 
types, I EV {p,q) and I E c(p) are the EXIT function for the 
i-th variable node type and z-th check node type, Xi and pi 
are the fractions of edges towards the variable nodes of type 
i and the check nodes of type i. 

For the scope of this work it is useful to isolate, in the 
contribution of the repetition codes and, in (O, the contribution 



of the SPC codes: 

(rep) (gen) 

I E y{p, q)=J2 A 5 r) ' C 1 - IP 3 '') + E A * J Sv(P> «) 
j>2 i 

(rep) (gen) 



5>? ^ffv (p. «) (6) 

j>2 i 



(SPC) 

Ie.c(p) = Yl 

J>2 



(SPC) 

p) 



(gen) 

(i-p) j '- x + X>^ 



£ } c(p) 



(gen) 



Aspc(1 -p) + 



Pi^EAP)- 



(7) 



In ©, j is the length of the generic repetition variable node, 
X^ is the fraction of edges towards the repetition nodes of 



0) 



It uses the well 



length j, Xr(x) is defined as J2j>2 
known EXIT function expression on the BEC, I E (p, q) = 1 — 
qpP -1 , for a (j, 1) repetition variable node. The summation in 
i is over all the generalized variable node types. Analogously, 
in ©, j is the length of the generic SPC node, Pj SPC ' ) is the 
fraction of edges towards the SPC nodes of length j, pspc(x) 
is defined as J2j>2 Pj SPC ' > x^ 1 , and the expression I E (p) = 
(1 — p) 3 , valid for a (j ,j — 1) SPC check node, is used. 

The EXIT function for an (n, k) generalized variable node 
of a D-GLDPC code on the BEC can be expressed as 



-1 k 



i E (p,q) = i-- y_ yv,p*(i-p) 



(=0 z=0 



(8) 



expression which can be developed from [11, eq. 36], where 
a t , z = [{n - t)e n -t,k-z ~ (* + l)e n -t-i,k-*]- The parameter 
&g,h (with g = 0, . . . , n and h = 0, . . . , k) is the (g, h)-th 
un-normalized split information function, defined as explained 
next. Considering a representation of the generator matrix G 
for the (n, k) variable node, and appending to it the (k x k) 
identity matrix e g> h is equal to the summation of the ranks 
over all the possible submatrices obtained selecting g columns 
in G and h columns in We remark that the split information 
functions for a generalized variable node, and then its EXIT 
function, heavily depend on the code representation, i.e. on 
the chosen generator matrix [4]. Then, the performance of the 
overall D-GLDPC code depends on the code representation 
used at the generalized variable nodes. 

The EXIT function for a generalized (n, k) check node of 
a GLDPC or D-GLDPC code on the BEC can be obtained by 
letting q — » 1 in ^ (no communication channel is present). 
The obtained expression, equivalent to [11, eq. 40], is 



Ie(p) = 1 



1 



t=0 



(9) 



where, a t = (n — t) e n -t — (t + l)e„_t_i and where for 



.9 = 0,. 



is the g-th un-normalized information 



function of the (n, k) code, a concept first introduced in 



independent set 
of size j 



n- j 



row 

summations 



G': 



k — a 



A 





B 


C 



Fig. 2. If rank(G) = r and the first j columns are an independent set, then 
G can be transformed as shown by row summations only, with A, B and C 
non-null matrices. 



[12]. It is defined as the summation of the ranks over all 
the possible submatrices obtained selecting g columns from 
the generator matrix G. As opposed to the split information 
functions, the information functions are independent of the 
code representation. Thus, different code representations lead 
to the same EXIT function for the generalized check node. The 
performance of the overall D-GLDPC code is then independent 
of the specific representation of the generalized check nodes. 

Equations ([8]) and (O assume that MAP erasure correction 
is performed at the variable and check node. 

III. Independent Sets and Minimum Distance 

The development of a generalized stability condition for 
GLDPC and D-GLDPC codes is mostly based on a theorem 
proposed in this section. This theorem establishes a sufficient 
condition for a (k x (n — t)) binary matrix, obtained by 
selecting n — t columns in the generator matrix G (any 
representation) of a (n, k) linear block code, to have rank 
equal to k. The results presented in this section can be also 
developed from [12]. Here, they are independently proved and 
formulated for the scope of the paper. 

Definition 1 (independent set): Given a (k x n) binary ma- 
trix of rank r, a set of t columns is said an independent 
set when removing these t columns from the matrix leads 
to a (k x (n — i)) matrix with rank r — Ar < r, for some 
< Ar < t. The number t is the size of the independent set. 

If j columns are an independent set for a certain represen- 
tation of the generator matrix, they are an independent set for 
any other representation. Moreover, removing them from any 
representation of the generator matrix leads to the same rank 
reduction Ar. This is because all the possible representations 
of the generator matrix can be obtained by row summations 
starting from any other representation. 

Lemma 1: If any representation of the generator matrix of 
a (n, k) linear block code has an independent set of size j, 



then the code minimum distance satisfies d n 



< 



Proof: Let us suppose that j columns of a generator 
matrix G are an independent set of size j. Then, it must be 
possible to perform row summations on G in order to obtain 



a new generator matrix representation G', in which a certain 
number a of rows have all their l's in correspondence of only 
the columns of the independent set (see for example Fig. [2] 
where the first j columns are supposed an independent set, 
and where A, B and C are non-null matrices). Any of these 
a rows is a valid codeword. Then, d m i U < j. □ 

Theorem 1: Let G be any representation of the generator 
matrix of an (n, k) linear block code. Then, the following 
statements are equivalent: 

a) the code has minimum distance t; 

b) the minimum size of the independent sets of G is t. 
Proof: [a =>■ b] If d m i n = t, then it is possible to 

construct a representation of G where there is at least one row 
with exactly t l's. The columns of G corresponding to these t 
1 's are an independent set (of size <), because removing them 
from G leads to a reduction of the rank. This independent set 
must be of minimum size. In fact, if it existed an independent 
set of size j < t, then from Lemma[T]it would follow d min < t, 
thus violating the hypothesis d m j n = t. 

[b =>• a] Let us suppose that the minimum size of the 
independent sets of G is t, and let us consider an independent 
set of size t. From Lemma Q] it follows that d m in < t. The 
proof is completed by showing that it is not possible to have 
dmin < t. In fact, if d m j n = j < t then, by reasoning in the 
same way as for the [a =>• b] proof, it would follow that the 
minimum size of the independent sets of G is j < i, which 
violates the hypothesis. □ 

The present section is concluded by the following lemma (not 
necessary to prove Theorem Q}. 

Lemma 2: Let k < n, and t be the minimum size of the 
independent sets of a (k x n) binary matrix with rank r. Then, 
removing any independent set of size t, leads to a (k x (n — t)) 
matrix with rank r — 1. 

Proof: Since t is the minimum size of the independent 
sets of the matrix, then removing any set of j < t columns 
does not affect the rank. For an independent set of size t, 
one can remove any subset of t — 1 columns without reducing 
the rank; when removing the t-th column, the rank can only 
decrease by 1. □ 

IV. Stability Condition and Derivative Matching 
for GLDPC Codes 

In GLDPC codes, all the variable nodes are repetition codes. 
Recalling (O, and observing that in this case 2~2f>^ *f = 
1, the EXIT function on the BEC for the VND is" given by 
Ie,v{p,i) — 1 — qX^(x). Hence: 

dlE.y{p, q)/dp | p= o = -q*^ ] . (10) 

Recalling ©, the derivative of Ie,c(p) at p — is 

<1Ie,c(p) 



dp 



p=0 



(gen) 



(p) 



dp 



(ID 



In order to develop the previous expression, it is necessary to 
express the derivative of the EXIT function for the generalized 



check nodes. This task can be performed by exploiting the 
theorem proved in Section [TTTJ as explained next. 

Consider an (n, k) generalized check node, with EXIT 
function Ie(p) in the form ©. It is readily shown that the 
derivative of Ie(j>), computed at p = 0, is d/ ^ p ^ | p= o = 
(n-i)a n -ai _ According t0 Theorem[I] a = if and only if the 
generalized check node has minimum distance d m [ n > 2. In 
fact, the generator matrix of the check node is full rank (rank 
= k) by definition, so e n — k. Furthermore, from Theorem Q] 
removing any single column from the generator matrix does 
not reduce the rank if and only if <i m i n > 2, thus leading 
to e n -k — nk. Then, an. = ne n — e n -i = nk — nk = 0. 
As recalled in Section Q] the hypothesis d m i n > 2 is always 
assumed in this paper. Then, it will be always assumed an. = 0. 



If dmin > 2 for the check node, then 



dI E (p,q) 



dp 

where a% = (n — l)e„_i — 2 e„_2 = k n (n — 1) 
applying again Theorem Q] we obtain 



Ip=o : 

■ 2 6 r , 



«1 



= 

7^0 



if d n 
if d n 



> 3 
= 2. 



2- By 



(12) 



In fact, if d mm > 3, removing any pair of columns from the 
generator matrix does not affect the rank. In this case 2 e„_2 = 
2 k (2) = kn {n — 1), hence a\ = 0. 

According to these results, the only generalized check nodes 
that give some contribution to the summation in the second 
member of ( fTTT i are those characterized by d m ; n = 2. By 
recalling that all the SPC codes have minimum distance 2, we 
conclude that dlE ^ \ P =o only depends on the check nodes 
with d m i n = 2. The derivative at p = of the EXIT function 
for the CND can be then expressed as 

<He,c(p) 

kiTiiijli 



dp 

-PspcC 1 



E 



l)-2e nj _ 2 



-PspcW- E 



2 Pl 



A 



n-2i 



where the summation is over the generalized check node types 
with minimum distance 2. In the last equality, A^l 2 > is 
defined as follows. Let S rii -2 be the generic (ki x (rii — 2)) 
matrix obtained by selecting rii — 2 columns in the generator 
matrix. Then, A^l 2 = J2s _ — rank(5 ni _2)), where 
the summation is over all the possible matrices S ni ^2- The 
parameter A^i 2 does not depend on the chosen representation 
for the i-th generalized check node. 

The derivative at p — for the inverse EXIT function 
Ij$ c(p) of the CND is simply given by the inverse of 
(I13t . Then, the stability condition dlE,v{lA, Q*)/9Ia \i a =i < 



dI EC /dl 



A \I A = 1 



for GLDPC codes leads to 



< 



A 2 r) fp' S P C (l) 



'-min = ^ r, 
^ ^ 



(14) 



an upper bound on the threshold q* which represents a neces- 
sary condition for successful GLDPC (asymptotic) decoding. 

For GLDPC codes satisfying the derivative matching con- 
dition d3j (the first occurrence of a tangency point between 
lE,v(p,q) and /^^(p) appears at p — 0), the threshold 
assumes the following simple closed form: 



A 2 r) (V S Pc(l) 



2— m 



(15) 



If only generalized check nodes with d m j n > 3 are used, 
the stability condition ( TT4| ) and the threshold expression ( fT3b 
become, respectively, 

-1 



< 



^2 ^ PSPc(l) 



— ^2^PSPC(1) 



(16) 



(17) 



V. Stability Condition and Derivative Matching 
for D-GLDPC Codes 

The derivative at p — of the EXIT function for the CND 
of D-GLDPC codes is the same as for GLDPC codes, and is 
expressed by ( fT3l l. The derivative with respect to p, at p = 0, 
of the EXIT function for the VND is developed next. 

It follows from ©: 



dlE,v(p,q) 



dp 



p=0 



— -q A 2 



(gen) 



dl { £ v (p,q) 



dp 



(18) 



In order to develop the summation on the generalized variable 
node types in the second term, the derivative of the EXIT 
function for each generalized variable node type, with respect 
to p and at p = 0, can be computed based on J8j. By defining 

Sip) = Et=o at-zP* (1 ~ P) n ^\ it results 



(13) d! E (p, q) 



dp \p=o 



k 

= -;E« 2 (l-9) 

n — ^ 



k-AS{p) 



dp 



p=0 



= Yq*(l~q) k - z ("- 1 K'- a ^ ; (19) 
2=0 

where the fact df(p)/dp\ p= o = — (n — l)ao,z + &i,z h as been 
exploited. The previous relationship can be further developed 
by exploiting Theorem [1] Since, by hypothesis, any variable 
node has minimum distance at least 2, removing any single 
column from the generator matrix G of the code associated 
to the variable node does not affect the rank of G. It follows 
a-o.z = ne ni k- z -e n -i,k-z = kn ( fe *J -kn ( fc *J = 0, thus 

leading to ^g^| P=0 - - Eto ^ (1 ™ ^ ^f- 
Theorem [T] can be invoked again in order to show that 



a 1. 



= Vz 
^0 Vz 



if d„ 



> 3 



(20) 



2 If the derivatives are computed with respect to p - 
condition is 8Ie,v(p, Q*)/dp \ p =o > d/^^/dp \ p= 



1 — I a, the stability 



Some results about the stability condition on the BEC for GLDPC codes 
can be also found in [13, Appendix 7.A]. 



where c? m ; n is the minimum distance of the (n, k) code 
associated to the variable node under analysis. In fact, under 
the hypothesis c? m j n > 3, no independent sets of size 1 
and 2 are present in (any representation of) the generator 
matrix G. Then a M = (n - l)e„_i !fc _ 2 - 2e„_ 2 ,fc-z = 
kn(n-l)( k k _ z )-2k( n n _ 2 ) ( k k _ z ) = 0. 

It follows that the contribution of the generalized variable 
nodes to ?lE>^i£i3l | p _ Q j s o if they all have minimum distance 
greater than or equal to 3. The only non-null contribution 
comes from the generalized variable nodes with minimum 
distance d m i n = 2, which is coherent with the fact that, among 
the repetition codes, only those with d mnl — 2 (i.e. the length- 
2 repetition codes) give a non-null contribution. 

Then, ( fT8T > can be developed as 



dlE,v{p, q) 



dp 



-q A 2 



p=0 



"3 A 2 



dmin = 2 



E 9* 0- - D 



fci- 



z=0 



2e„.._ 



2,fe«-2 



rf min =2 



hi 

E 

z=0 



<f (1 - q) 



In the previous expression, A„l 2 i s defined in a similar 

way as A^l 2 for GLDPC codes. Let iS ni _2,fc;-z be the generic 
(ki x (rii — 2 + ki — z)) matrix obtained by selecting m — 2 
columns in the generator matrix, and ki — z columns in the 
(ki x ki) identity matrix. Then, A%'_ 2 k _ z = 2Zs„._ 2iSi ._ s ( fc '~ 
rank(<S nj _2,fej-z)), where the summation is over all the 
possible matrices iS ni _ 2l fe 4 _ z . Differently from A^_ 2 , the 
parameter A„i 2 k _ z depends on the code representation. 

By combining (fT3l and (fJTJ the stability condition 
dlE,v{p,q*)/dp\ p= o > dl]^ 1 c (p)/dp \ p= o assumes the form: 



fci- 



: 2A£ A (i) 



n — 2,k— z ' 



(21) 



A. 



dmi„=2 

E 



fci 

E 

z=0 



(?*)*(! "ST 



ki—z 



2A, A 



n— 2,k — z 



< 



Pspci 1 ) 



d„i„=2 



A 



(i) 



(22) 



This inequality is a necessary condition for (asymptotic) 
successful D-GLDPC decoding on the BEC. Differently from 
the GLDPC case, it is not possible to express this inequality as 
an upper bound on q*, because of the impossibility to factor q* 
from the summation in z. For D-GLDPC codes satisfying the 
derivative matching condition, d22l holds with equality. For 
the same reason, it does not lead to an explicit closed form 
expression of the threshold. However, if d m i n > 3 for all the 
generalized variable nodes, then any term of the summation 
over the generalized variable nodes is null. In this case, the 
same upper bound to q* as in (15[ holds: 

(Jmln=2 



< 



Ai r) U PC (l)+ E 



and this inequality is achieved with equality by D-GLDPC 
codes satisfying the derivative matching condition. Moreover, 



if rfmin > 3 also for all the generalized check nodes, then the 
upper bound on q* assumes the simple form as in dT6b : 

-l 



< 



A 2 ^ Pspc(I) 



This inequality is achieved with equality by D-GLDPC codes 
satisfying the derivative matching condition. 

VI. Conclusion 

In this paper, a stability condition on the BEC has been 
derived for GLPDC and D-GLDPC codes, generalizing the 
inequality q* < [A'(0)//(1)] _1 , valid for standard LDPC 
codes. A derivative matching condition has been also defined, 
sufficient to achieve the stability condition with equality. 

As for LDPC codes, only the variable and check nodes with 
minimum distance 2 are involved in the stability condition. The 
stability condition for GLDPC codes can be always explicitly 
expressed as an upper bound on the decoding threshold. D- 
GLDPC codes do not share this property in general; however, 
if all the generalized variable nodes have minimum distance 
at least 3, the stability condition becomes the same as for 
GLDPC codes. As a consequence, for GLDPC codes and for 
D-GLDPC codes with variable component codes of minimum 
distance at least 3 the decoding threshold assumes a simple 
closed form. 
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